Combining Independent Knowledge Sources for Word Sense Disambiguation

نویسندگان

Yorick Wilks

Mark Stevenson

چکیده

Disambiguation Yorick Wilks and Mark Stevenson Department of Computer Science, University of She eld, Regent Court, 211 Portobello Street, She eld S1 4DP, UK fyorick, [email protected] Abstract Sense tagging, the automatic assignment of the appropriate sense from some lexicon to each of the words in a text, is a specialised instance of the general problem of word sense disambiguation. We discuss which recent word sense disambiguation algorithms are appropriate for sense tagging. It is our belief that sense tagging can be carried out effectively by combining several simple, independent, methods and we include the design of such a tagger. A prototype of this system has been implemented, correctly tagging 88% of polysemous word tokens in a small test set, providing evidence that our hypothesis is correct. 1 Sense Tagging There has been a tendency in word sense disambiguation (WSD hereafter) literature to carry out di erent disambiguation tasks and classify them all as WSD procedures. There are however, at least, three di erent levels of algorithm in WSD. The most speci c of these are sense tagging procedures, these assign, to each word1 in a text, its particular sense from some lexicon and each word type having a set of senses speci c to it. This di ers from the more general case of semantic tagging, where the tags for each word (type) need not be speci c to that type and do not correspond to word senses in a lexicon. These tags may be broad semantic categories such as HUMAN or ANIMATE or WordNet synsets which may apply to any word in the text (similar to part of speech tagging in the sense that there is a class of tags which may apply to any token in the text). The most general class of algorithms are semantic disambiguation algorithms. These algorithms are procedures which carry out semantic disambiguation on words, these may not necessarily be tagging algorithms, in that they do not attempt to mark every token in a text but may be restricted to disambiguating small sets of word types. The class of sense tagging algorithms is a proper subset of the class of semantic tagging algorithms, which is, in turn, a proper subset of the class of semantic disambiguation algorithms. This hierarchical relationship is represented in Figure 1. 1This is often loosened to each content word. Semantic Tagging Tagging Sense Semantic Disambiguation

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word Sense Disambiguation using Optimised Combinations of Knowledge Sources

Word sense disambiguation algorithms, with few exceptions, have made use of only one lexical knowledge source. We describe a system which performs unrestricted word sense disambiguation (on all content words in free text) by combining different knowledge sources: semantic preferences, dictionary definitions and subject/domain codes along with part-of-speech tags. The usefulness of these sources...

متن کامل

Combining Weak Knowledge Sources for Sense Disambiguation

There has been a tradition of combining different knowledge sources in Artificial Intelligence research. We apply this methodology to word sense disambiguation (WSD), a long-standing problem in Computational Linguistics. We report on an implemented sense tagger which uses a machine readable dictionary to provide both a set of senses and associated forms of information on which to base disambigu...

متن کامل

Combining Knowledge- and Corpus-based Word-Sense-Disambiguation Methods

In this paper we concentrate on the resolution of the lexical ambiguity that arises when a given word has several different meanings. This specific task is commonly referred to as word sense disambiguation (WSD). The task of WSD consists of assigning the correct sense to words using an electronic dictionary as the source of word definitions. We present two WSD methods based on two main methodol...

متن کامل

Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation

This work combines a set of available techniques – which could be further extended – to perform noun sense disambiguation. We use several unsupervised techniques (Rigau et al., 1997) that draw knowledge from a variety of sources. In addition, we also apply a supervised technique in order to show that supervised and unsupervised methods can be combined to obtain better results. This paper tries ...

متن کامل

Combining ConceptNet and WordNet for Word Sense Disambiguation

Knowledge-based Word sense Disambiguation (WSD) methods heavily depend on knowledge. Therefore enriching knowledge is one of the most important issues in WSD. This paper proposes a novel idea of combining WordNet and ConceptNet for WSD. First, we present a novel method to automatically disambiguate the concepts in ConceptNet; and then we enrich WordNet with large amounts of semantic relations f...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Combining Independent Knowledge Sources for Word Sense Disambiguation

نویسندگان

چکیده

منابع مشابه

Word Sense Disambiguation using Optimised Combinations of Knowledge Sources

Combining Weak Knowledge Sources for Sense Disambiguation

Combining Knowledge- and Corpus-based Word-Sense-Disambiguation Methods

Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation

Combining ConceptNet and WordNet for Word Sense Disambiguation

عنوان ژورنال:

اشتراک گذاری